Error Dynamics Based Dual Heuristic Dynamic Programming for Self-Learning Flight Control
نویسندگان
چکیده
A data-driven nonlinear control approach, called error dynamics-based dual heuristic dynamic programming (ED-DHP), is proposed for air vehicle attitude control. To solve the optimal tracking problem, augmented system defined by derived dynamics and reference trajectory so that actor neural network can learn feedforward feedback terms at same time. During online self-learning process, learns policy minimizing system’s value function. The input identified recursive least square (RLS) output of critic are used to update network. In addition, total uncertainty term also RLS, which compensate caused inaccurate modeling, parameter perturbation, on. outputs ED-DHP include rough trim surface, from network, compensation. Based on this scheme, complete knowledge not needed, offline learning unnecessary. verify ability ED-DHP, two numerical experiments carried out based established morphing model. One sinusoidal signal a fixed operating point, other guidance command with process variable points. simulation results demonstrate good performance validate robustness scheme
منابع مشابه
Dual Heuristic Programming for Fuzzy Control
Overview material for the Special Session (Tuning Fuzzy Controllers Using Adaptive Critic Based Approximate Dynamic Programming) is provided. The Dual Heuristic Programming (DHP) method of Approximate Dynamic Programming is described and used to the design a fuzzy control system. DHP and related techniques have been developed in the neurocontrol context but can be equally productive when used w...
متن کاملReinforcement Control via Heuristic Dynamic Programming
Heuristic Dynamic Programming (HDP) is the simplest kind of Adaptive Critic which is a powerful form of reinforcement control 1]. It can be used to maximize or minimize any utility function, such as total energy or trajectory error, of a system over time in a noisy environment. Unlike supervised learning, adaptive critic design does not require the desired control signals be known. Instead, fee...
متن کاملElectromagnetic Formation Flight Control Using Dynamic Programming
Electromagnetic formation flight (EMFF) is an enabling technology for a number of spacecraft mission architectures. The RINGS program will be the first time EMFF is demonstrated in a microgravity environment. Nonlinearities due to magnetic field interactions preclude linear feedback controllers from being used to control the RINGS system. Approximate dynamic programming is explored in this pape...
متن کاملExtracting Dynamics Matrix of Alignment Process for a Gimbaled Inertial Navigation System Using Heuristic Dynamic Programming Method
In this paper, with the aim of estimating internal dynamics matrix of a gimbaled Inertial Navigation system (as a discrete Linear system), the discretetime Hamilton-Jacobi-Bellman (HJB) equation for optimal control has been extracted. Heuristic Dynamic Programming algorithm (HDP) for solving equation has been presented and then a neural network approximation for cost function and control input ...
متن کاملNatural Heuristic Dynamic Programming for Dynamic Systems
Heuristic Dynamic Programming (HDP) is the simplest kind of Adaptive Critic 1]. It can be used to maximize or minimize any utility function, such as total energy or trajectory error, of a system over time in a noisy environment. In this article, we propose a new version of HDP, called NHDP (Natural Heuristic Dynamic Programming). This new version incorporates basic HDP algorithm with the follow...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2022
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13010586